Picture for Xuming Hu

Xuming Hu

May

Thinking Economically: A Hierarchical Framework for Adaptive-Complexity Reasoning in LLMs

Add code
May 31, 2026
Viaarxiv icon

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization

Add code
May 29, 2026
Viaarxiv icon

NestedKV: Nested Memory Routing for Long-Context KV Cache Compression

Add code
May 26, 2026
Viaarxiv icon

SAMark: A Self-Anchored Text Watermarking with Paragraph-Level Paraphrase Robustness

Add code
May 25, 2026
Viaarxiv icon

When Looking Is Not Enough: Visual Attention Structure Reveals Hallucination in MLLMs

Add code
May 12, 2026
Viaarxiv icon

Where to Focus: Query-Modulated Multimodal Keyframe Selection for Long Video Understanding

Add code
Apr 19, 2026
Viaarxiv icon

Correct Prediction, Wrong Steps? Consensus Reasoning Knowledge Graph for Robust Chain-of-Thought Synthesis

Add code
Apr 15, 2026
Viaarxiv icon

Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation

Add code
Apr 14, 2026
Viaarxiv icon

Visual Late Chunking: An Empirical Study of Contextual Chunking for Efficient Visual Document Retrieval

Add code
Apr 11, 2026
Viaarxiv icon

StreamMeCo: Long-Term Agent Memory Compression for Efficient Streaming Video Understanding

Add code
Apr 10, 2026
Viaarxiv icon